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The  World  Bank 

Open  Source  Tools  for  Transport  Planning 

Indicator  Calculation  Methodology 

The  calculation  methods  outlined  in  this  document  provide  examples  of  how  to  measure  indicators 
using  a spatially  enabled  SQL  database.  These  examples  do  not  necessarily  prescribe  the  exact  approach 
that  full-fledged  software  will  take,  but  provide  a proof-of-concept  and  logical  structure  for  calculations. 
With  more  robust  software,  there  will  likely  be  performance  enhancements  related  to  memory  caching 
and  multithreading  that  can  be  taken  advantage  of. 

Number  of  modes  and  types 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  Route  types 

Discussion:  The  route  types  table  contains  friendly  names  of  the  types  of  transit  used  in  the  system.  This 
query  joins  the  description  to  the  routes  table,  counts  the  number  of  individual  routes,  and  groups  by 
the  route  type. 

SQL: 

SELECT  t. description,  COUNT(r.*)  as  numlines  FROM  gtfs_routes  r 
JOIN  gtfs_route_types  t ON  r.route_type  = t.route_type 
GROUP  BY  t .description; 

Transit  system  length  in  km 

Required  Data:  GTFS 

Tables  Accessed:  Shapes  or  Stop  times 

Discussion:  The  shapes.txt  GTFS  file  is  optional,  as  is  the  stop_times  shape_distance_traveled  field.  This 
indicator  will  not  be  available  for  GTFS  data  that  includes  neither  shapes  nor  stop  distance  traveled. 

This  series  of  queries  creates  and  spatially  enables  two  new  tables  in  the  database.  The  shapelines  table 
contains  a line  for  each  shapejd  from  the  shapes.txt  table.  Subsequent  queries  calculate  line  length  and 
group  by  route,  mode,  and  system. 

SQL: 

--This  creates  a polyline  table  with  the  local  UTM  projection 
CREATE  TABLE  shapelines  (shape_id  VARCHAR,  len_ft  NUMERIC); 

SELECT  AddGeometryColumn  ( ' shapelines ' , 'geom',  32619,  ' LINESTRING ' , 2); 

--Add  a local  projection  column  to  the  points 

SELECT  AddGeometryColumn  ( ' gtf s_shapes ' , 'geom',  32619,  'POINT',  2); 
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UPDATE  gtfs_shapes  SET  geom  = ST_Transform(ST_SetSRID(ST_MakePoint(shape_pt_lon, 
shape_pt_lat ) , 4326 ) , 32619 ) ; 

--Builds  the  lines 

INSERT  INTO  shapelines  (SELECT  shape_id,  0.0,  ST_MakeLine(ST_SnapToGrid(geomj  1.0) 
ORDER  BY  shape_pt_sequence)  as  newgeom  FROM  gtfs_shapes  GROUP  BY  shape_id); 

UPDATE  shapelines  SET  len_metens  = ST_Length(geom); 


--  length  by  mode  if  there's  a shapes.txt 
SELECT  descriptionj  SUM(avg_len)  as  modelength 
FROM 

(SELECT  r.route_id,  AVG(s . len_meters/1000)  as  avg_lenj  r.route_type 
FROM  shapelines  s 

JOIN  gtfs_trips  t ON  s.shape_id  = t.shape_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  r.route_id,  s . len_meters , r.route_type 
) as  lengths 

JOIN  gtfs_route_types  USING( route_type) 

GROUP  BY  description 

--  if  no  shapes.txt  and  stop_distance_traveled  is  populated 

SELECT  descriptionj  SUM(avg_len)  as  modelength 

FROM 

(SELECT  r.route_idj  AVG(s . maxstopdist)  as  avg_lenj  r.route_type 

FROM  (SELECT  trip_idj  MAX(shape_dist_traveled)/1000  AS  maxstopdist 

FROM  gtfs_stop_times  GROUP  BY  trip_id)  s 

JOIN  gtfs_trips  t ON  t.trip_id  = s.trip_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  r.route_idj  r.route_type 

) as  lengths 

DOIN  gtfs_route_types  USING( route_type) 

GROUP  BY  description 


Number  of  stops 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  and  stop  times 

Discussion:  This  query  counts  the  number  of  stops  per  route  or  mode. 
SQL: 

--By  Route 
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SELECT  route_idj  MAX(numben)  as  num_stops 
FROM 

(SELECT  t.tnip_idj  r.route_id,  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_id 
) as  route_stops 
GROUP  BY  route_id; 

--By  Mode 

SELECT  gtf s_route_types .description,  COUNT(*) 

FROM 

(SELECT  st.stop_id,  r . route_typej  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  st.stop_id,  r.route_type 
) as  route_stops 

JOIN  gtfs_route_types  USING( route_type) 

GROUP  BY  gtf s_route_types . description; 

Weekly  number  of  hours  of  service 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  stop  times,  calendar,  calendar  dates,  frequencies 
Discussion:  The  GTFS  definition  recommends  that  stop  times  after  midnight  not  wrap  around  the  day 
change  (going  from  23:59  to  00:00),  but  instead  continue  above  the  24-hour  mark.  This  calculation 
depends  on  the  data  following  this  recommendation. 

The  SQL  query  below  only  gets  hours  of  service  for  Monday.  The  query  will  be  repeated  for  each  day  of 
the  week,  and  the  duration  for  each  day  can  be  summed  to  get  the  weekly  number  or  hours  of  service. 
The  query  controls  for  some  routes  being  operated  on  a frequency  basis. 

SQL: 

--For  individual  routes 

--  Duration  of  Service  (for  a single  weekday) 

--  By  route 

SELECT  mins . route_id , maxs. maxtime  - mins.mintime  as  duration  FROM 
(SELECT  MIN(mintime)  as  mintime,  route_id,  route_type  FROM 

(SELECT  MIN(CAST( SUBSTRING (a rrival_time  FROM  0 FOR  3)  AS  INTEGER))  as  mintime, 
r.route_id,  r.route_type 
FROM  gtfs_stop_times  st 
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JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
JOIN  gtfs_routes  n on  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1)  AND  NOT 
EXISTS  (SELECT  gtf s_f nequencies .trip_id  FROM  gtf s_f requencies  WHERE  st.trip_id  = 
gtfs_f requencies . trip_id) 

GROUP  BY  r.route_idj  n.route_type 

UNION 

SELECT  MIN(CAST (SUBSTRING(stant_time  FROM  0 FOR  3)  AS  INTEGER))  as  mintime j r.route_idj 

n. route_type 

FROM  gtfs_f requencies  f 

JOIN  gtfs_trips  t ON  f.trip_id  = t.trip_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1) 

GROUP  BY  r.route_idj  r.route_type 

) as  f req_stop_mins  GROUP  BY  route_idj  route_type)  as  mins 
JOIN 

(SELECT  MAX(maxtime)  as  maxtimej  route_idj  route_type  FROM 

(SELECT  MAX(CAST(SUBSTRING(arrival_time  FROM  0 FOR  3)  AS  INTEGER))  as  maxtimej 
r.route_idj  r.route_type 
FROM  gtfs_stop_times  st 

JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1)  AND  NOT 
EXISTS  (SELECT  gtf s_f requencies .trip_id  FROM  gtf s_f requencies  WHERE  st.trip_id  = 
gtfs_f requencies . trip_id) 

GROUP  BY  r.route_idj  r.route_type 

UNION 

SELECT  MAX(CAST(SUBSTRING(end_time  FROM  0 FOR  3)  AS  INTEGER))  as  maxtimej  r.route_idj 

r . route_type 

FROM  gtfs_f requencies  f 

JOIN  gtfs_trips  t ON  f.trip_id  = t.trip_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1) 

GROUP  BY  r.route_idj  r.route_type 

) as  f req_stop_maxs  GROUP  BY  route_idj  route_type)  as  maxs 
ON  mins . route_id  = maxs . route_id; 

--By  mode 

SELECT  route_types . description j MAX ( maxs . maxtime)  - MIN(mins .mintime)  as  duration 
FROM 
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(SELECT  mintimej  route_idj  route_type  FROM 

(SELECT  MIN (CAST( SUBSTRING (a mi val_time  FROM  0 FOR  3)  AS  INTEGER))  as  mintimej 
n.route_idj  r.route_type 

FROM  gtfs_stop_times  st 

30IN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
JOIN  gtfs_routes  r on  r.route_id  = t.noute_id 

WHERE  t.senvice_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday 
= 1)  AND  NOT  EXISTS  (SELECT  gtf s_f nequencies .trip_id  FROM  gtfs_f requencies  WHERE 
st.trip_id  = gtfs_frequencies.tnip_id) 

GROUP  BY  r.route_idj  n.route_type 
UNION 

SELECT  MIN(CAST(SUBSTRING(stant_time  FROM  0 FOR  3)  AS  INTEGER))  as  mintimej  r.route_idj 

r . route_type 

FROM  gtfs_f requencies  f 

JOIN  gtfs_trips  t ON  f.trip_id  = t.trip_id 
JOIN  gtfs_routes  r ON  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1) 

GROUP  BY  r.route_idj  r.route_type 

) AS  f req_stop_mins  GROUP  BY  route_idj  route_typej  mintime)  as  mins 

JOIN 

(SELECT  maxtimej  route_idj  route_type  FROM 

(SELECT  MAX(CAST(SUBSTRING(arrival_time  FROM  0 FOR  3)  AS  INTEGER))  as  maxtimej 
r.route_idj  r.route_type 

FROM  gtfs_stop_times  st 

JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday 
= 1)  AND  NOT  EXISTS  (SELECT  gtf s_f requencies .trip_id  FROM  gtf s_f requencies  WHERE 
st.trip_id  = gtfs_frequencies.trip_id) 

GROUP  BY  r.route_idj  r.route_type 
UNION 

SELECT  MAX(CAST (SUBSTRING(end_time  FROM  0 FOR  3)  AS  INTEGER))  as  maxtimej  r.route_idj 

r . route_type 

FROM  gtfs_f requencies  f 

JOIN  gtfs_trips  t ON  f.trip_id  = t.trip_id 
JOIN  gtfs_routes  r ON  r.route_id  = t.route_id 

WHERE  t.service_id  IN  (SELECT  service_id  FROM  gtf s_calendar  WHERE  monday  = 1) 

GROUP  BY  r.route_idj  r.route_type 

) AS  f req_stop_maxs  GROUP  BY  route_idj  route_typej  maxtime)  as  maxs 
ON  mins. route  id  = maxs. route  id 
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JOIN 

(SELECT  * FROM  gtf s_route_types ) as  route_types  ON  mins . route_type  = 

route_types . route_type 

GROUP  BY  route_types .description; 

Ratio  of  number  of  stops  to  route-length 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  shapes  or  stop  times 

Discussion:  There  are  two  ways  to  derive  the  length  of  a route,  as  shown  above.  The  below  queries  rely 
on  either  a shapes.txt  file  or  the  stop_distance_traveled  field  in  the  stops_times.txt  field. 

SQL: 

SELECT  lenavgs . route_id,  stopscount . num_stops  / lenavgs . routelength  AS  coverage 
FROM 

(SELECT  route_id,  route_type,  AVG(len_meters/1000)  as  routelength 
FROM 

(SELECT  s.shape_id,  s . len_meterSj  t.route_id,  r.route_type 
FROM  shapelines  s 

JOIN  gtfs_trips  t ON  s.shape_id  = t.shape_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  s.shape_id,  s . len_meters , t.route_id,  r.route_type 

) as  lengths 

GROUP  BY  route_id,  route_type)  as  lenavgs 
JOIN 

(SELECT  route_id,  MAX(number)  as  num_stops 
FROM 

(SELECT  t.trip_id,  r.route_idj  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_id,  r.route_id 
) as  route_stops 

GROUP  BY  route_id)  as  stopscount 
ON  lenavgs . route_id  = stopscount . route_id; 


--by  mode 

SELECT  gtf s_route_types .description,  stopssum. numbstops  / lenavgs .modelength  AS 

coverage 

FROM 

(SELECT  route_type,  SUM(avg_len)  as  modelength 
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FROM 

(SELECT  r.route_idj  AVG(s . len_meters/1000)  as  avg_lenj  r.route_type 
FROM  shapelines  s 

JOIN  gtfs_trips  t ON  s.shape_id  = t.shape_id 
JOIN  gtfs_noutes  r on  r.route_id  = t.noute_id 
GROUP  BY  r.route_idj  s . len_meters , r.route_type 
) as  lengths 

GROUP  BY  route_type)  as  lenavgs 
JOIN 

(SELECT  route_typej  SUM(num_stops)  as  numbstops 
FROM 

(SELECT  route_idj  route_typej  MAX(numben)  as  num_stops 
FROM 

(SELECT  t.trip_idj  r.route_idj  route_typej  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_idj  route_type 
) as  route_stops 

GROUP  BY  route_typej  route_id)  as  stopscount 
GROUP  BY  route_type)  as  stopssum 
ON  lenavgs . route_type  = stopssum. route_type 

JOIN  gtfs_route_types  ON  lenavgs . route_type  = gtf s_route_types . route_type; 

--for  when  there's  no  shapes.txt 

SELECT  lenavgs . route_idj  stopscount . num_stops  / lenavgs . routelength  AS  coverage 
FROM 

(SELECT  r.route_idj  AVG(s . maxstopdist)  as  routelengthj  r.route_type 

FROM  (SELECT  trip_idj  MAX(shape_dist_traveled)/1000  AS  maxstopdist 

FROM  gtfs_stop_times  GROUP  BY  trip_id)  s 

JOIN  gtfs_trips  t ON  t.trip_id  = s.trip_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  r.route_idj  r.route_type 

) as  lenavgs 

JOIN 

(SELECT  route_idj  MAX(number)  as  num_stops 
FROM 

(SELECT  t.trip_idj  r.route_idj  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_id 
) as  route_stops 

GROUP  BY  route_id)  as  stopscount 
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ON  lenavgs . route_id  = stopscount. route_id; 

--by  mode 

SELECT  gtf s_route_types .description,  stopssum. numbstops  / lenavgs .modelength  AS 

coverage 

FROM 

(SELECT  route_typej  SUM(routelength)  as  modelength 
FROM 

(SELECT  r.route_idj  AVG(s . maxstopdist)  as  routelength,  r.route_type 

FROM  (SELECT  trip_id,  MAX(shape_dist_traveled)/1000  AS  maxstopdist 

FROM  gtfs_stop_times  GROUP  BY  trip_id)  s 

JOIN  gtfs_trips  t ON  t.trip_id  = s.trip_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  r.route_id,  r.route_type 

) as  lengths 

GROUP  BY  route_type)  as  lenavgs 
JOIN 

(SELECT  route_typej  SUM(num_stops)  as  numbstops 
FROM 

(SELECT  route_idj  route_typej  MAX(number)  as  num_stops 
FROM 

(SELECT  t.trip_id,  r.route_id,  route_type,  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_id,  r.route_id,  route_type 
) as  route_stops 

GROUP  BY  route_typej  route_id)  as  stopscount 
GROUP  BY  route_type)  as  stopssum 
ON  lenavgs . route_type  = stopssum. route_type 

JOIN  gtfs_route_types  ON  lenavgs . route_type  = gtf s_route_types . route_typej 

Average  distance  between  stops 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  shapes  or  stop  times 

Discussion:  Similar  to  the  above  indicator,  there  are  two  ways  to  calculate  this  indicator,  depending  on 
data  available. 

SQL: 

--  works  if  shape_dist_traveled  is  populated 
--  By  Route 

SELECT  lenavgs . route_id,  lenavgs . routelength  / stopscount . num_stops  AS  coverage 
FROM 
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(SELECT  r.noute_idj  AVG(s . maxstopdist)  as  routelengthj  n.route_type 

FROM  (SELECT  trip_idj  MAX(shape_dist_traveled)/1000  AS  maxstopdist 

FROM  gtfs_stop_times  GROUP  BY  trip_id)  s 

JOIN  gtfs_tnips  t ON  t.trip_id  = s.tnip_id 

JOIN  gtfs_noutes  r on  r.route_id  = t.route_id 

GROUP  BY  r.route_idj  n.route_type 

) as  lenavgs 

JOIN 

(SELECT  route_idj  MAX(numben)  as  num_stops 
FROM 

(SELECT  t.trip_idj  r.route_idj  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_id 
) as  route_stops 

GROUP  BY  route_id)  as  stopscount 
ON  lenavgs . route_id  = stopscount . route_id; 

--By  Mode 

SELECT  gtf s_route_types . description j lenavgs .modelength  / stopssum. numbstops  AS 

coverage 

FROM 

(SELECT  route_typej  SUM(routelength)  as  modelength 
FROM 

(SELECT  r.route_idj  AVG(s . maxstopdist)  as  routelengthj  r.route_type 

FROM  (SELECT  trip_idj  MAX(shape_dist_traveled)/1000  AS  maxstopdist 

FROM  gtfs_stop_times  GROUP  BY  trip_id)  s 

JOIN  gtfs_trips  t ON  t.trip_id  = s.trip_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  r.route_idj  r.route_type 

) as  lengths 

GROUP  BY  route_type)  as  lenavgs 
JOIN 

(SELECT  route_typej  SUM(num_stops)  as  numbstops 
FROM 

(SELECT  route_idj  route_typej  MAX(number)  as  num_stops 
FROM 

(SELECT  t.trip_idj  r.route_idj  route_typej  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_idj  route_type 


Azavea 


11 


rev  1:  25  March  2013 


The  World  Bank:  OSTTP  - Indicator  Calculation  Methodology 


•lazavea 


) as  route_stops 

GROUP  BY  route_typej  route_id)  as  stopscount 
GROUP  BY  route_type)  as  stopssum 
ON  lenavgs . route_type  = stopssum. route_type 

JOIN  gtfs_noute_types  ON  lenavgs . route_type  = gtf s_noute_types . route_type; 

--otherwise,  need  to  use  shape  length 
--  By  Route 

SELECT  lenavgs . route_idj  lenavgs . noutelength  / stopscount . num_stops  AS  coverage 
FROM 

(SELECT  route_idj  route_type,  AVG(len_meters/1000)  as  routelength 
FROM 

(SELECT  s.shape_idj  s . len_meterSj  t.route_idj  r.route_type 
FROM  shapelines  s 

JOIN  gtfs_trips  t ON  s.shape_id  = t.shape_id 

JOIN  gtfs_routes  r on  r.route_id  = t.route_id 

GROUP  BY  s.shape_idj  s . len_meters , t.route_idj  r.route_type 

) as  lengths 

GROUP  BY  route_idj  route_type)  as  lenavgs 
JOIN 

(SELECT  route_idj  MAX(number)  as  num_stops 
FROM 

(SELECT  t.trip_idj  r.route_id,  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  10IN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  10IN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_id 
) as  route_stops 

GROUP  BY  route_id)  as  stopscount 
ON  lenavgs . route_id  = stopscount . route_id; 

--By  Mode 

SELECT  gtfs_route_types .descriptionj  lenavgs .modelength  / stopssum. numbstops  AS 

coverage 

FROM 

(SELECT  route_typej  SUM(avg_len)  as  modelength 
FROM 

(SELECT  r.route_idj  AVG(s . len_meters/1000)  as  avg_lenj  r.route_type 
FROM  shapelines  s 

JOIN  gtfs_trips  t ON  s.shape_id  = t.shape_id 
JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  r.route_idj  s . len_meters , r.route_type 
) as  lengths 
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GROUP  BY  route_type)  as  lenavgs 
JOIN 

(SELECT  route_type,  SUM(num_stops)  as  numbstops 
FROM 

(SELECT  route_idj  route_typej  MAX(numben)  as  num_stops 
FROM 

(SELECT  t.tnip_idj  r.route_id,  route_typej  COUNT(*)  as  number 
FROM  gtfs_stop_times  st 

LEFT  JOIN  gtfs_trips  t ON  st.trip_id  = t.trip_id 
LEFT  JOIN  gtfs_routes  r on  r.route_id  = t.route_id 
GROUP  BY  t.trip_idj  r.route_idj  route_type 
) as  route_stops 

GROUP  BY  route_typej  route_id)  as  stopscount 
GROUP  BY  route_type)  as  stopssum 
ON  lenavgs . route_type  = stopssum. route_type 

JOIN  gtfs_route_types  ON  lenavgs . route_type  = gtf s_route_types . route_type; 

Average  weekday  peak,  non-peak,  and  aggregated  time  traveled 
between  stops 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  stop  times,  calendar 

Discussion:  This  indicator  averages  the  time  it  takes  to  get  between  consecutive  stops  along  a route, 
aggregated  at  different  levels.  Including  the  day  of  the  week  and  limits  on  stop  times  narrow  the  focus 
to  particular  blocks  of  time.  The  below  query  is  the  average  without  regard  to  time. 

SQL: 

--  by  route 

SELECT  d.route_id,  AVG( (b . maxtime  - a.mintime)  * 60  / c.numstops)  AS 
avg_min_between_stops 

FROM  (SELECT  trip_id,  MIN(CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60)  as  mintime, 
min(stop_sequence)  as  minseq  FROM  gtfs_stop_times  GROUP  BY  trip_id  ORDER  BY  trip_id)  a 
JOIN  (SELECT  trip_id,  MAX(CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60)  as  maxtime, 
max(stop_sequence)  as  maxseq  FROM  gtfs_stop_times  GROUP  BY  trip_id  ORDER  BY  trip_id)  b 
ON  a.trip_id  = b.trip_id 

JOIN  (SELECT  trip_id,  COUNT(*)  as  numstops  FROM  gtf s_stop_times  GROUP  BY  trip_id  ORDER 
BY  trip_id)  c ON  a.trip_id  = c.trip_id 
JOIN  gtfs_trips  d ON  d.trip_id  = a.trip_id 
GROUP  BY  d . route_id 

--by  mode 
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SELECT  f. description , AVG((b. maxtime  - a.mintime)  * 60  / c.numstops)  AS 
avg_min_between_stops 

FROM  (SELECT  trip_id,  MIN(CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60)  as  mintime, 
min(stop_sequence)  as  minseq  FROM  gtfs_stop_times  GROUP  BY  trip_id  ORDER  BY  trip_id)  a 
JOIN  (SELECT  trip_id,  MAX(CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60)  as  maxtime, 
max(stop_sequence)  as  maxseq  FROM  gtfs_stop_times  GROUP  BY  trip_id  ORDER  BY  trip_id)  b 
ON  a.trip_id  = b.trip_id 

JOIN  (SELECT  trip_id , COUNT(*)  as  numstops  FROM  gtf s_stop_times  GROUP  BY  trip_id  ORDER 

BY  trip_id)  c ON  a.trip_id  = c.trip_id 

JOIN  gtfs_trips  d ON  d.trip_id  = a.trip_id 

JOIN  gtfs_routes  e ON  d.route_id  = e.route_id 

JOIN  gtfs_route_types  f ON  e.route_type  = f.route_type 

GROUP  BY  f .description 

Average  system  weekday  / weekend  frequency 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  stop  times,  calendar,  frequencies 

Discussion:  Compiles  the  headway  for  a specific  stop  and  route  combination,  then  averages  by  route  and 
mode.  Weekend  and  weekday  options  can  be  selected  by  modifying  the  join  to  the  calendar  table.  This 
query  accommodates  GTFS  data  that  relies  on  frequency-based  scheduling  using  the  frequencies.txt 
table. 

SQL: 

--  Average  service  frequency,  by  route. 

--  The  actual  calculation  is  (l/(sum/duration) )*60*60  to  get  from  headway  in  seconds  to 
frequency  in  hours 

--  Formally,  the  headway  is  the  period,  defined  as  the  reciprocal  of  the  frequency. 
SELECT  3600*(srv_end-srv_start)/(sum)  as  buses_per_hr,  route_id  FROM 

(SELECT  SUM(trip_headway_hrs_wghted) , MIN(starttime)  as  srv_start,  MAX(endtime)  as 
srv_end,  route_id  FROM 

(SELECT  (endtime-starttime)*headway_secs  as  trip_headway_hrs_wghted , starttime, 
endtime,  trip_id,  route_id  FROM 

(SELECT  start_time,  end_time,  f reqs .trip_id,  trips . route_id,  headway_secs , 
CAST(substring(start_time  FROM  1 FOR  2)  as  decimal)  + 

CAST(substring(start_time  FROM  4 for  2)  as  decimal)/60  AS  starttime, 
CAST(substring(end_time  FROM  1 FOR  2)  as  decimal)  + 

CAST(substring(end_time  FROM  4 for  2)  as  decimal)/60  AS  endtime 
FROM  gtfs_f requencies  as  freqs 


JOIN 
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(SELECT  trip_idj  service_idj  route_id  FROM  gtfs_tnips)  AS  trips  ON 
trips .trip_id=f reqs . trip_id 

JOIN 


(SELECT  service_idj  mondayj  tuesdayj  wednesdayj  thursdayj  friday  FROM  gtf s_calendar 

WFIERE  monday='l'  OR  tuesday='l'  OR  wednesday= ' 1 ' OR  thursday='l'  OR  friday='l' 
) AS  weekdays 

ON  weekdays. service_id=trips.service_id)  AS  trip_route_start_end_hrs 

) AS  wghted_headways 

GROUP  BY  route_id)  as  agg_components 

UNION  --Connect  stop-based  counting  to  frequency-based  counting 

select  avg(num/duration_hrs)  as  buses_per_hrj  rt_stp_duration . route_id  from 
--Get  the  duration  (and  some  other  stats)  separately 

(select  (max(rt_stp_hrs.arrtime)  - min(rt_stp_hrs.arrtime))  as  duration_hrs , count(*) 
as  nurrij  rt_stp_hrs . route_idj  rt_stp_hrs  . stop_id  from 

--Gives  all  arrivals  for  one  stop  without  regard  to  day. 

(select  distinct  gtfs_stop_times.trip_idj  arrival_timej  departure_timej  stop_idj 
trips . route_idj 

CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60  as  arrtime 
from  gtfs_stop_times 

--Gives  the  trip_id  to  join  the  next  query  on 
join(select  trip_idj  service_idj  route_id  from  gtfs_trips)  as  trips  on 
trips .trip_id=gtf s_stop_times . trip_id 
--Gives  all  trips  which  occur  on  the  specified  days. 
join(select  service_idj  mondayj  tuesdayj  wednesdayj  thursdayj  friday 
from  gtf s_calendar 

where  monday='l'  or  tuesday='l'  or  wednesday= ' 1 ' or  thursday='l'  or 
friday='l')  as  weekdays 
on  weekdays . service_id=trips . service_id 
--Filter  out  anything  that's  in  frequencies 

WFIERE  NOT  EXISTS  (SELECT  gtf s_f requencies .trip_id  FROM  gtfs_f  requencies  WFIERE 
gtfs_stop_times .trip_id  = gtfs_frequencies.trip_id) 

order  by  arrtimej  stop_id)  as  rt_stp_hrs  group  by  rt_stp_hrs . route_idj 
rt_stp_hrs . stop_id 

) as  rt_stp_duration  where  rt_stp_duration . duration_hrs  > 0.05  group  by 
rt_stp_duration . route_id 
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--by  mode 

SELECT  AVG(buses_per_hr) , description  FROM 

((SELECT  3600*(srv_end-srv_start)/(sum)  as  buses_per_hrj  route_id  FROM 


(SELECT  SUM(trip_headway_hrs_wghted) , MIN(starttime)  as  srv_startj  MAX(endtime)  as 
srv_endj  route_id  FROM 

(SELECT  (endtime-starttime)*headway_secs  as  trip_headway_hrs_wghted , starttimej 
endtimej  trip_idj  route_id  FROM 

(SELECT  start_timej  end_timej  f reqs .trip_idj  trips . route_idj  headway_secs , 
CAST(substring(start_time  FROM  1 FOR  2)  as  decimal)  + 
CAST(substring(start_time  FROM  4 for  2)  as  decimal)/60  AS  starttimej 
CAST(substring(end_time  FROM  1 FOR  2)  as  decimal)  + 

CAST(substring(end_time  FROM  4 for  2)  as  decimal)/60  AS  endtime 
FROM  gtfs_f requencies  as  freqs 

JOIN 


(SELECT  trip_idj  service_idj  route_id  FROM  gtfs_trips)  AS  trips  ON 
trips .trip_id=f reqs . trip_id 


JOIN 


(SELECT  service_idj  mondayj  tuesdayj  wednesdayj  thursdayj  friday  FROM  gtf s_calendar 

WHERE  monday='l’  OR  tuesday='l'  OR  wednesday= ' 1 ' OR  thursday='l'  OR  friday='l' 
) AS  weekdays 

ON  weekdays. service_id=trips.service_id)  AS  trip_route_start_end_hrs 

) AS  wghted_headways 

GROUP  BY  route_id)  as  agg_components 

UNION  --Connect  stop-based  counting  to  frequency-based  counting 

select  avg(num/duration_hrs)  as  buses_per_hrj  rt_stp_duration . route_id  from 
--Get  the  duration  (and  some  other  stats)  separately 

(select  (max(rt_stp_hrs.arrtime)  - min(rt_stp_hrs.arrtime))  as  duration_hrs , count(*) 
as  numj  rt_stp_hrs . route_idj  rt_stp_hrs . stop_id  from 

--Gives  all  arrivals  for  one  stop  without  regard  to  day. 

(select  distinct  gtfs_stop_times.trip_idj  arrival_timej  departure_timej  stop_idj 
trips . route_idj 

CAST(substring(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60  as  arrtime 
from  gtfs_stop_times 

--Gives  the  trip_id  to  join  the  next  query  on 
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join(select  trip_idj  service_idj  noute_id  from  gtfs_trips)  as  trips  on 
trips .trip_id=gtf s_stop_times . trip_id 
--Gives  all  trips  which  occur  on  the  specified  days. 
join(select  service_idj  mondayj  tuesdayj  Wednesday,  thursdayj  friday 
from  gtf s_calendar 

where  monday=’l'  or  tuesday='l'  or  wednesday= ' 1 ' or  thursday='l'  or 
friday='l')  as  weekdays 
on  weekdays . service_id=trips . service_id 
--Filter  out  anything  that's  in  frequencies 

WHERE  NOT  EXISTS  (SELECT  gtf s_f requencies .trip_id  FROM  gtfs_f requencies  WHERE 
gtfs_stop_times .trip_id  = gtfs_frequencies.trip_id) 

order  by  arrtimej  stop_id)  as  rt_stp_hrs  group  by  rt_stp_hrs . route_id , 
rt_stp_hrs . stop_id 

) as  rt_stp_duration  where  rt_stp_duration . duration_hrs  > 0.05  group  by 
rt_stp_duration . route_id 
) as  avg_f req_by_rt 

JOIN 

(SELECT  route_idj  route_type  FROM  gtfs_routes)  AS  types  ON  types . route_id  = 
avg_f req_by_rt . route_id 

JOIN 

(SELECT  route_typej  description  FROM  gtfs_route_types)  as  descs  ON 
types . route_type=descs . route_type 
) AS  f req_with_descs 
GROUP  BY  description; 


System  Affordability  as  a Proportion  of  Poverty  Line  Income 

Required  Data:  GTFS,  road  network 
Tables  Accessed:  Fare  attributes,  fare  rules 

Discussion:  Where  fare  information  is  not  included  in  the  GTFS  data,  an  average  one-way  fare  can  be 
during  software  configuration.  The  income  threshold  defining  poverty  will  also  be  collected  during 
configuration.  The  threshold  in  the  United  States  for  an  individual  is  $1 1 ,490 

--  Rides  per  month  = 42,  Fare  = $2.50,  Poverty  line  = $11490  divided  by  12  months 
SELECT  (42  * 2.5)  / (11490  / 12)  * 100; 


Azavea 


17 


rev  1:  25  March  2013 


The  World  Bank:  OSTTP  - Indicator  Calculation  Methodology 


•lazavea 


Ratio  of  Total  Transit  Lines  Length  over  Total  Road  Network  Length 
(RBR) 

Required  Data:  GTFS,  road  network,  regional  boundary 
Tables  Accessed:  Shapes,  roads,  region 

Discussion:  This  set  of  queries  buffers  the  transit  system  shape  by  3m,  then  clips  the  road  network  that 
falls  within  that  buffer.  This  is  the  measure  of  roadway  covered  by  the  transit  system,  which  is  divided 
by  the  total  road  network  length,  each  relative  to  the  region  or  city  boundary.  In  the  software,  this 
sequence  will  need  significant  optimization,  as  the  SQL  performance  is  quite  slow. 

SQL: 

--  First  calculate  the  length  of  road  segments 
ALTER  TABLE  roads  ADD  COLUMN  len_meters  NUMERIC; 

UPDATE  roads  SET  len_meters  = ST_Length (geom) ; 


--  Create  a spatial  index  on  shapelines 

CREATE  INDEX  spidx_shapelines  ON  shapelines  USING  GIST(geom); 


--  Create  a spatial  index  on  roads 

CREATE  INDEX  spidx_roads  ON  roads  USING  GIST(geom); 


--  Takes  a long  time 

CREATE  TABLE  shapesbuffer  AS  SELECT  ST_Union (ST_Buffer (shapelines . geom, 3) ) as  geom  FROM 
shapelines; 

SELECT  DISTINCT 

(SELECT  SUM(len_meters/1000) 

FROM 

(SELECT  ST_Intersection(shapesbufferunion .geom, roads .geom),  len_meters 
FROM  shapesbufferunion , roads 

WHERE  ST_Intersects(shapesbufferunion . geom, roads . geom) ) as  clip)  / 

(SELECT  SUM(len_meters/1000) 

FROM 

(SELECT  ST_Intersection(region . geom, roads .geom) , len_meters 
FROM  region,  roads 

WHERE  ST_Intersects(region.geom,roads.geom))  as  clip) 

FROM  shapelines; 

Transit  Line  Network  Density  (BND) 

Required  Data:  Shapes,  regional  boundary 
Tables  Accessed:  Shapes,  region 
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Discussion:  The  result  of  this  indicator  is  an  index  representing  kilometers  of  transit  lines  over  square 
kilometers  of  area.  The  query  selects  a subset  of  lines  where  they  intersect  the  regional  and  city 
boundaries.  The  line  lengths  are  calculated  in  the  same  way  as  the  above  query. 

SQL: 

--  for  county  boundary 

SELECT  (SELECT  SUM(len_meters)/1000  FROM 

(SELECT  ST_Intersection(shapesbufferunion .geom, roads .geom)  as  geom,  len_meters 
FROM  shapesbufferunion j roads 

WFIERE  ST_Intersects(shapesbufferunion . geom,  roads . geom) ) as  clip,  region 
WHERE  ST_Intersects(region.geom,clip.geom))  / (SELECT  SUM(ST_Area(geom))/1000000  from 
region)  as  BND; 

--  for  city  boundary 

SELECT  (SELECT  SUM(len_meters)/1000  FROM 

(SELECT  ST_Intersection(shapesbufferunion .geom, roads .geom)  as  geom,  len_meters 
FROM  shapesbufferunion j roads 

WFIERE  ST_Intersects(shapesbufferunion . geomj  roads  . geom) ) as  clipj  city 
WHERE  ST_Intersects(city.georfijShapelines.geom))  / (SELECT  SUM(ST_Area(geom)  )/1000000 
from  city)  as  BND; 

Coverage  of  Transit  Stops  for  500  meter  Radius  (CBS_500) 

Required  Data:  Stops,  boundary  files 
Tables  Accessed:  Stops,  region,  city 

Discussion:  The  stops  table  must  first  be  given  a spatial  column,  if  it  does  not  already  have  one.  The 
calculation  unions  the  500m  buffers  of  each  stop,  then  divides  the  area  by  the  total  regional  or  city  area. 
Only  stops  within  the  comparison  area  are  buffered.  If  the  system  has  lines  using  informal  stops,  a 500m 
buffer  of  the  lines  themselves  can  be  used. 

SQL: 

SELECT  AddGeometryColumn  ( ' gtf s_stops ' , 'geom',  2264,  'POINT',  2); 

UPDATE  gtfs_stops  SET  geom  = 

ST_Transform(ST_SetSRID(ST_Makepoint(stop_lon, stop_lat)  ,4326) , 2264); 


SELECT 

(SELECT  ST_Area(STJJnion(ST_Buffer(gtf s_stops . geom,  500) ) ) 

FROM  gtfs_stops,  city 

WHERE  ST_Intersects(city.georrijgtfs_stops.geom))  / (SELECT  SUM(ST_Area(geom) ) from  city) 
as  CBS_500; 


SELECT 

(SELECT  ST_Area(ST_Union(ST_Buffer(gtf s_stops . geom,  500) ) ) 
FROM  gtfs_stops,  region 
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WHERE  ST_Intersects(region.geonijgtfs_stops.geom))  / (SELECT  SUM(ST_Area(geom) ) from 
region)  as  CBS_500; 

--  For  informal  stop  usage 

--  This  will  take  some  time  to  union  the  line  buffer  - in  production  softwarej  this 

buffer  would  be  cached 

SELECT 

(SELECT  ST_Area(ST_Union(ST_Buffer(shapelines . georrij  500) ) ) 

FROM  shapelinesj  city 

WHERE  ST_Intersects(city. georrij  shapelines  .geom) ) / (SELECT  SUM(ST_Area(geom))  from 
city)  as  CBS_500; 


SELECT 

(SELECT  ST_Area(ST_Union(ST_Buffer(  shapelines . georrij  500) ) ) 

FROM  shapelinesj  region 

WHERE  ST_Intersects(region.geomj  shapelines . geom) ) / (SELECT  SUM(ST_Area (geom) ) from 
region)  as  CBS_500; 

Ratio  of  the  Transit-Pattern  Operating  Suburban  Lines  (STR) 

Required  Data:  GTFS,  city  boundary 

Tables  Accessed:  Stops,  trips,  routes,  stop  times,  city 

Discussion:  This  query  calculates  the  ratio  of  the  number  of  routes  with  stops  outside  the  urban  center 
of  the  system  to  the  total  number  of  routes. 

SQL: 

SELECT  CAST (COUNT ( *)  as  DECIMAL)  / CAST((SELECT  COUNT(*)  FROM  gtfs_routes)  AS  DECIMAL) 
FROM 

(SELECT  t.route_id  FROM  gtfs_trips  t 

JOIN  gtfs_stop_times  st  ON  st.trip_id  = t.trip_id 

WHERE  st. stop_id  IN  (SELECT  stop_id  FROM  gtfs_stops,  city  WHERE  NOT 
ST_Contains( city. geom,  gtfs_stops .geom) ) 

GROUP  BY  t.route_id)  as  suburbanites; 

On-Time  Performance  (for  low  frequency-  service) 

Required  Data:  GTFS,  Real-time  analysis  results 
Tables  Accessed:  Routes,  trips,  stop  times,  calendar 

Discussion:  This  indicator  is  calculated  using  the  real-time  analysis  results,  and  measures  the  average 
deviation  from  schedule  time  per  route  and  mode.  Additional  metrics  (standard  deviation,  z-scores,  etc.) 
may  be  desirable  once  data  is  available  and  sample  results  calculated. 

Regularity  of  Headways  (for  high-frequency  service) 

Required  Data:  GTFS,  Real-time  analysis  results 

Tables  Accessed:  Routes,  trips,  stop  times,  calendar,  frequencies 
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Discussion:  This  indicator  is  calculated  using  the  real-time  analysis  results,  and  measures  the  average 
deviation  from  scheduled  frequency  per  route  and  mode.  Additional  metrics  (standard  deviation,  z- 
scores,  etc.)  may  be  desirable  once  data  is  available  and  sample  results  calculated. 


Dwell  Time  Performance 

Required  Data:  GTFS,  Real-time  analysis  results 
Tables  Accessed:  Routes,  trips,  stop  times,  calendar 

Discussion:  This  indicator  is  calculated  using  the  real-time  analysis  results,  and  measures  the  average 
deviation  from  scheduled  dwell  time  per  route  and  mode.  Additional  metrics  (standard  deviation,  z- 
scores,  etc.)  may  be  desirable  once  data  is  available  and  sample  results  calculated.  Dwell  time  is  a 
somewhat  unreliable  measure  in  the  real-time  analysis  results,  and  indicator  results  should  be 
understood  in  this  context. 

Travel  Time  Performance 

Required  Data:  GTFS,  Real-time  analysis  results 
Tables  Accessed:  Routes,  trips,  stop  times,  calendar 

Discussion:  This  indicator  is  calculated  using  the  real-time  analysis  results,  and  measures  the  average 
deviation  from  scheduled  time  between  stops  per  route  and  mode.  Additional  metrics  (standard 
deviation,  z-scores,  etc.)  may  be  desirable  once  data  is  available  and  sample  results  calculated. 


Average  Service  Frequency  (ASF)  within  city  and  within  a bounded 
area 

Required  Data:  GTFS 

Tables  Accessed:  Routes,  trips,  stop  times,  calendar 

Discussion:  This  indicator  is  calculated  in  the  same  fashion  as  average  weekday  / weekend  frequency. 

System  coverage  (city  and  bounded  area) 

Required  Data:  Stops,  boundary  files 
Tables  Accessed:  Stops,  region,  city 

Discussion:  The  calculation  for  this  query  is  identical  to  that  of  CBS500,  with  the  exception  that  the 
buffer  distance  may  be  variable. 

System  accessibility  (city  and  bounded  area) 

Required  Data:  Population  data,  GTFS 
Tables  Accessed:  Stops,  population 

Discussion:  This  query  is  a template  for  selecting  a subset  of  the  population  and  setting  the  buffer 
distance  of  transit  stops.  In  this  case,  it  is  the  total  population  measured  in  2012  within  500m  of  a stop. 
SQL: 
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SELECT  CAST(subpop.subpopsum  AS  DECIMAL)  / CAST(totalpop . totalpopsum  AS  DECIMAL)  as 
ratio 

FROM  (SELECT  SUM(pop2012)  as  subpopsum  FROM  pop, (SELECT 

ST_Union(ST_Buffer(gtf s_stops . geom, 500) ) as  geom  FROM  gtfs_stops)  as  coverage  WFIERE 
ST_Intersects(coverage.geom,  pop. geom))  AS  subpopj 
(SELECT  SUM(pop2012)  as  totalpopsum  FROM  pop)  as  totalpop; 

System  accessibility  for  low-income  residents  (city  and  bounded  area) 

Required  Data:  Population  data,  GTFS 
Tables  Accessed:  Stops,  population 

Discussion:  Similar  to  the  above  indicator,  this  query  is  a template  to  compare  a subset  of  a population 
to  overall  population  served  by  the  transit  system.  Configurable  items  include  the  subset  of  the 
population  to  choose,  as  well  as  the  buffer  distance  from  stops. 

SQL: 

SELECT  CAST(subpop. subpopsum  AS  DECIMAL)  / CAST(totalpop . totalpopsum  AS  DECIMAL)  as 
ratio 

FROM  (SELECT  SUM( low_income)  as  subpopsum  FROM  pop, (SELECT 

ST_Union(ST_Buffer(gtf s_stops . geom, 500) ) as  geom  FROM  gtfs_stops)  as  coverage  WHERE 
ST_Intersects(coverage.geom,  pop. geom))  AS  subpop, 

(SELECT  SUM( pop2012)  as  totalpopsum  FROM  pop, (SELECT 
ST_Union(ST_Buffer(gtf s_stops . geom, 500 

))  as  geom  FROM  gtfs_stops)  as  coverage  WHERE  ST_Intersects(coverage.geom,  pop. geom)) 
as  totalpop; 

Service  frequency,  weighted  by  served  population  (city  and  bounded 
area) 

Required  Data:  GTFS,  population 

Tables  Accessed:  Routes,  trips,  stop  times,  population 

Discussion:  This  query  weights  the  frequency  of  each  stop,  per  route,  with  the  population  served  (within 
500m).  The  frequency  is  counted  for  each  individual  served,  and  the  average  is  calculated  by  dividing  by 
the  sum  of  population. 

SQL: 

select  sum(freq_wt_pop)/sum(stoppop)  as  pop_weighted_f req_per_hr  from 
(select  (stop_freq.freq*pop_served_stop.stoppop)  as  freq_wt_pop, 
pop_served_stop . stoppop,  stop_f req . stop_id  from 

(select  (stop_durtn_counts . num/stop_durtn_counts . duration_hrs)  as  freq, 
stop_durtn_counts . stop_id  from 

(select  count(*)  as  num,  (max(stop_trip_times.arrtime)- 
min(stop_trip_times.arrtime))  as  duration_hrs,  stop_id  from 

--Gives  all  arrivals  for  one  stop  without  regard  to  day. 

(select  distinct  gtfs_stop_times.trip_id,  arrival_time,  departure_time,  stop_id. 
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CAST(substning(arrival_time  from  1 for  2)  as  decimal)  + 
cast(substring(arrival_time  from  4 for  2)  as  decimal)/60  as  arrtime 
from  gtfs_stop_times 

--Gives  the  trip_id  to  join  the  next  query  on 
join( 

select  trip_idj  service_id  from  gtfs_trips 
) as  trips  on  trips. trip_id=gtfs_stop_times.trip_id 
--Gives  all  trips  which  occur  on  the  specified  days. 
join( 

select  service_idj  monday,  tuesdayj  wednesdayj  thursday,  friday 
from  gtfs_calendar 

where  monday='l’  or  tuesday='l'  or  wednesday= ' 1 ' or  thursday=’l'  or  friday='l' 

) as  weekdays 

on  weekdays . service_id=trips . service_id 
order  by  arrtime 

) as  stop_trip_times  group  by  stop_id 

) as  stop_durtn_counts  where  stop_durtn_counts .duration_hrs  > 0.05 
) as  stop_freq 

JOIN 

(SELECT  gtf s_stops . stop_id , SUM(pop. pop2012)  as  stoppop 
FROM  gtfs_stops 
INNER  30IN  pop 

ON  ST_Intersects(ST_Buffer(gtfs_stops.geomj500)j  pop.geom) 

GROUP  BY  gtf s_stops . stop_id 
) as  pop_served_stop 

ON  pop_served_stop . stop_id=stop_f req . stop_id 
) as  weight_pop; 

Job  accessibility 

Required  Data:  GTFS,  population  data,  job  data 
Tables  Accessed: 

Discussion:  This  indicator  requires  network  traversal  to  be  calculated  accurately,  and  cannot  be 
calculated  simply  in  SQL.  The  approach  to  calculating  this  indicator  is  to  set  up  a series  of  job  locations  as 
destination  / origins,  calculate  the  travel  time  from  each  job  origin,  and  then  select  and  sum  the 
population  that  falls  into  the  composite  travel  shed.  Since  the  technology  architecture  to  calculate  this 
indicator  is  as  of  yet  undetermined,  a full  example  is  temporarily  deferred. 

Access  Index 

Required  Data: 

Tables  Accessed: 
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Discussion:  This  indicator  is  currently  undefined. 
SQL: 


Boston  Example  Results 

Number  of  modes  and  types 

description  | numlines 


Bus  | 197 

Intercity  Rail  | 12 

Underground  Rail  | 8 

Ferry  | 3 

Street  Level  Rail  | 11 

Transit  System  Length  in  kilometers 

description  | modelength 


Bus  | 10036.37838040307908800000 
Intercity  Rail  | 1534.6089044789766000 
Underground  Rail  | 194.6697746270275200 
Street  Level  Rail  | 211.4597392782880600 


Number  of  Stops  per  Route 


route_id 

1 

num_stops 

131 

1 

49 

505 

1 

28 

CR-Fitchburg 

1 

18 

94 

1 

40 

275 

1 

2 

79 

1 

21 

351 

1 

24 

07 

1 

19 

CR-Greenbush 

1 

10 

38 

1 

43 

222 

1 

61 

240 

1 

82 

34 

1 

45 

880_ 

1 

20 

68 

1 

15 
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436 

1 61 

439 

| 50 

43 

| 20 

221 

1 39 

47 

1 39 

32 

| 41 

9702 

| 22 

882_ 

1 16 

75 

1 27 

217 

1 85 

Number  of  stops  per  mode 
description  [ count 


+ 

Bus  | 

7696 

Intercity  Rail  | 

137 

Underground  Rail  | 

102 

Ferry  | 

6 

Street  Level  Rail  | 

150 

Weekly  number  of  hours  in  service  - these  results  are  for  Monday  only 
By  route: 


route_id 

1 

duration 

70A 

1 

15 

751 

1 

19 

50 

1 

14 

CR-Middleborough 

1 

18 

326 

1 

13 

708 

1 

14 

503 

1 

13 

112 

1 

14 

111 

1 

21 

238 

1 

18 

434 

1 

12 

Logan-33 

1 

11 

4050 

1 

1 

Boat-Fl 

1 

17 

80 

1 

20 

By  mode: 

description 

1 

duration 
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Bus 

1 

22 

Intercity  Rail 

1 

21 

Underground  Rail 

1 

22 

Ferry 

1 

17 

Street  Level  Rail 

1 

23 

Number  of  stops  per  kilometer 


route_id 

j coverage 

01 

| 3.7502145377915875 

04 

| 3.7175188317261752 

05 

| 5.2922152564834317 

07 

| 4.3793751226126449 

08 

| 5.6597016491916552 

09 

| 5.3970313635213175 

10 

| 6.1959845609160915 

100 

| 4.3392329903062473 

101 

| 6.2014935175235907 

104 

| 4.1668256815217900 

105 

| 4.2772058399081693 

106 

| 5.6742376700781739 

108 

| 5.9761372413109610 

109 

| 5.5054643279267203 

11 

| 5.1108343091188745 

110 

| 4.8847664659694873 

111 

| 3.9747968793396159 

Number  of  stops  per  kilometer  (by  mode) 
description  | coverage 


Street  Level  Rail  | 1.0262000735488507 
Underground  Rail  | 0.55992256737768206987 
Intercity  Rail  | 0.09969966911663780589 
Bus  | 0.73811485769256951194 


Average  distance  between  stops  (by  route) 


route_id  | coverage 

— + 


01 

| 0.26665141151867976071 

04 

| 0.26899661985993617647 

05 

| 0.18895678870486448276 
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07 

| 0.22834307909285026316 

08 

| 0.17668775882255642157 

09 

| 0.18528704627492575333 

10 

| 0.16139485019183901042 

100 

| 0.23045547501919772105 

101 

| 0.16125147872432585714 

104 

| 0.23999084109388140625 

105 

| 0.23379749243526465116 

106 

| 0.17623512763190672245 

108 

| 0.16733216785708121489 

Average  distance  between  stops  (by  mode) 


description  | coverage 


Street  Level  Rail  | 0.97446884460040580645 
Underground  Rail  | 1.7859612351103442 
Intercity  Rail  | 10.0301235586861216 
Bus  | 1.35480269713864458531 


Average  time  between  consecutive  stops  on  a trip  (in  minutes) 
description  | avg_min_between_stops 


+ 


Bus 

Intercity  Rail 
Underground  Rail 
Ferry 

Street  Level  Rail 


0.98250231267564764220 
5 . 78788857109534004145 
2 . 00569748113414382000 
9 . 01971726190476190483 
1 . 55378384223877357018 


Average  system  weekday  / 
buses_per_hr 


27.72509500897147623842 
12.99294354838709677417 
9.87217323397889808466 
48 . 00000000000000000038 
2.26797135454872592716 
9 . 76009605694933908543 
11 . 54450507141879846706 
5.03589183017887235715 
1.10735932707482292745 
13.50818971879160263960 
1.59104181729788717646 


peak  / weekend  frequency 
I route  id 


39 

171 

428 

9701 

CR- Lowell 
116 
70 
236 
913_ 

45 

CR-Worcester 
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10.36648362816475108598 

1 

117 

1.43611786269894734362 

1 

CR-Fitchburg 

7.70438552112994282449 

1 

100 

25.33451839914779233583 

1 

77 

3.64219653389170785362 

1 

439 

9 . 10478444081802356020 

1 

41 

1.25825246947865956589 

1 

Boat-F3 

7.44912312243229237423 

1 

94 

1.33467710707008739501 

1 

CR-Franklin 

8 . 53178877806942060774 

1 

27 

19.75025675839450091293 

1 

22 

6.55024726385309147265 

1 

18 

13.84694918990679130721 

1 

43 

5.39294337150375765559 

1 

436 

System  Affordability  as  a Proportion  of  Poverty  Line  Income 
10.97% 


Ratio  of  Transit  Length  to  Road  Length 
0.20958106432611699287 


Transit  Line  Network  Density 
Region:  0.0112457816953249 
City:  5.41088829044538 

Coverage  of  Transit  Stops  for  500  meters  Radius  (CBS_500) 
City:  1.06696659280942 
Region:  0.00352030475304973 


Ratio  of  the  Transit-Pattern  Operating  Suburban  Lines  (STR) 
0.68398268398268398268 


Percent  of  working-age  population  living  within  . 5 km  of  a transit  stop 
0.28810685020257480195 


Percent  of  sub-population  within  . 5 km  of  a transit  stop 
0.23048821881903212983 


Average  Service  Frequencyj  weighted  by  population 
17.03596367385869195058 
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Chicago  Example  Results 

Number  of  modes  and  types 
description  | numlines 


Bus  | 126 

Underground  Rail  | 8 


Transit  System  Length  in  kilometers 
description  | modelength 


Bus  | 9162.03636687290311700000 

Underground  Rail  | 620.2784371682611500 


route_id  | route_type  | count 

+ — — + 

route_id  | num_stops 


+ 

48  | 

57 

2 1 

45 

X98  | 

2 

135  | 

30 

94  | 

89 

79  | 

99 

63W  | 

35 

4 I 

106 

Y I 

6 

Pink  | 

37 

8A  | 

60 

205  | 

51 

82  | 

99 

34  | 

57 

68  | 

38 

157  | 

56 

43  | 

32 

5 | 107 


Number  of  stops  per  mode 
description  | count 
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Bus  | 11298 

Underground  Rail  | 296 

Weekly  number  of  hours  in  service  - these  results  are  for  Monday  only 
By  route: 

route_id  | duration 


50 

+ 

1 

20 

112 

1 

19 

111 

1 

19 

12 

1 

24 

314 

1 

21 

52A 

1 

20 

125 

1 

13 

80 

1 

24 

206 

1 

13 

85A 

1 

16 

9 

1 

25 

18 

1 

15 

51 

1 

16 

146 

1 

19 

53A 

1 

21 

86 

1 

17 

3 

1 

24 

91 

1 

19 

155 

1 

24 

Brn 

1 

25 

192 

1 

13 

35 

1 

21 

67 

1 

24 

63 

1 

24 

By  mode: 

description  | duration 


Bus  | 24 

Underground  Rail  | 24 

Number  of  stops  per  kilometer 
route_id  | coverage 


1 | 4.9938786919840467 
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10 

1.5646665884871703 

100 

5.9471060023054493 

103 

7.3265884031246463 

106 

4.4146123354051465 

108 

5.9045979835421426 

11 

4.9164065352113857 

111 

4.8793578310134675 

111A 

4.6760616956989990 

112 

6.9034594878388405 

115 

5.0111795980952442 

119 

5.2338399167791543 

12 

7.4172671444257571 

120 

3.3765359167102014 

121 

2.5882109326739186 

124 

4.4005484171337238 

125 

4.4145275520274712 

126 

7.7109473663606479 

132 

2.0711225424436801 

134 

2.0889753903850233 

135 

2.5033015794028763 

136 

2.4077353515506738 

143 

2.9510555774170951 

146 

4.1103437617742000 

Number  of  stops  per  kilometer  (by  mode) 

description  | coverage 

+- - 

Underground  Rail  [ 0.40626916058931238300 
Bus  | 0.80255084192703922177 

Average  distance  between  stops  (by  route) 

route_id  | distance 
— --+ 


48 

0.619872514619883 

2 

1.22084107744108 

X98 

25.948 

135 

1.32462629107981 

94 

0.668003554230685 

79 

0.508957523208976 

63W 

0.667007322929168 

4 

0.614729313393039 
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Y 

8.67699999999999 

Pink 

2.71516216216215 

8A 

0.602781193693695 

205 

0.8031686637618 

82 

0.605614202222893 

34 

0.581336370457717 

68 

0.634233760075867 

157 

0.623228416149068 

43 

0.626381913716813 

5 

0.596820427236315 

156 

0.588036610008319 

47 

0.563274876499646 

75 

0.591964285714286 

76 

0.589265438165443 

70 

0.612624919406834 

103 

0.544735833998403 

Average  distance  between  stops  (by  mode) 

description  | distance 

- — -+ 

Underground  Rail  | 3.09614382997897 
Bus  | 0.69755568756576 

Average  time  between  consecutive  stops  on  a trip  (in  minutes) 


By  route: 

route_id  | 

avg_min_between_stops 

+ 


48 

0.63584763254938693535 

2 

1.32155197248780136481 

X98 

12.99999999999999999990 

135 

1.62249841788452764655 

94 

0.66022541386438415069 

79 

0.72102027260736935337 

63W 

0.55514400167251353856 

4 

0.70323066165563254681 

Y 

4 . 00000000000000000000 

Pink 

1.66963871087582427788 

8A 

0.56853512978375241792 

205 

0.83544419208483461187 

82 

0.72980171977688700760 

34 

0.56334214210680942328 

68 

0.51031616279294297870 
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157 

| 1.00040082728926043118 

43 

| 0.71312714639309085449 

5 

| 0.44626168224299065421 

156 

| 0.97125599952462106789 

47 

| 0.68226065370483834319 

75 

| 0.62483577782085244772 

76 

| 0.77085729252957964941 

70 

| 0.72584190474715369023 

103 

| 0.48644756412739546500 

By  mode: 

description  | avg_min_between_stops 


Bus  | 0.75333144128802291409 

Underground  Rail  | 2.04041373993074064064 


Average  system  weekday  frequency  (buses  per  hour) 
By  route: 


buses_per_hr 

1 

route_id 

8.88700040769585157355 

1 

85 

9 . 08481596592020447795 

1 

75 

3 . 73048660490879002847 

1 

100 

11.23325320312689201950 

1 

55 

5.16289197525055587360 

1 

108 

4.95406680275577568474 

1 

95W 

8.73706888544796794151 

1 

157 

8.72996545277827868018 

1 

121 

7.20820497658871123797 

1 

86 

9.76016390288920069430 

1 

53A 

19 . 69030486304387848275 

1 

134 

11.09534473168975347087 

1 

22 

7.97929770076150466602 

1 

115 

8.08144589603739069920 

1 

111 

6.26287396793551092437 

1 

30 

6.95568847243399742586 

1 

28 

5 . 04109589041095890411 

1 

Y 

By  mode: 

avg  | description 


8.64073091203859964676  | Bus 
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6.04372039720269206074  | Underground  Rail 

System  Affordability  as  a Proportion  of  Poverty  Line  Income 
9.87% 


Ratio  of  Transit  Length  to  Road  Length 

0.76056620109906795973  --  This  is  calculated  with  the  city  boundaryj  not  regional 
boundary. 

Transit  Line  Network  Density 
City:  0.35074626865671641791 

Coverage  of  Transit  Stops  for  500  meters  Radius  (CBS_500) 

City:  0.905869514700218 

--  Regional  data  not  available  because  it  is  operated  by  different  agencies  (Pace  and 
Metra) 

Ratio  of  the  Transit-Pattern  Operating  Suburban  Lines  (STR) 

0.35074626865671641791  --  The  CTA  does  not  provide  much  service  outside  Chicago's 
borders;  this  is  provided  by  Pace  and  Metra. 

Percent  of  working-age  population  living  within  . 5 km  of  a transit  stop 
0.58694025909722974378 

Percent  of  sub-population  within  . 5 km  of  a transit  stop 
0.23048821881903212983 

Population  weight  frequency  per  hour,  system-wide: 

11 . 00965975405449364971 

Zhengzhou  Example  Results 

Number  of  modes  and  types 
description  | numlines 


Bus  | 226 

Transit  System  Length  in  kilometers 
description  | modelength 


Bus  | 3019.73899133209 
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Number  of  stops  by  route 


route_id  | num_stops 


289 

+ 

1 

12 

48 

1 

25 

113 

1 

26 

962 

1 

36 

Bll 

1 

28 

2 

1 

38 

B16 

1 

29 

131 

1 

19 

254 

1 

20 

135 

1 

17 

275 

1 

16 

B10 

1 

16 

79 

1 

29 

268 

1 

17 

4 

1 

18 

38 

1 

24 

Number  of  stops  per  mode 
description  | count 

-+ 

Bus  | 7031 

Weekly  number  of  hours  in  service  - these  results  are  for  Monday  only 
By  route: 

route_id  | duration 


50 

+ 

1 

16 

903 

1 

17 

273 

1 

14 

112 

1 

16 

621 

1 

13 

111 

1 

16 

12 

1 

15 

B1A 

1 

18 

272 

1 

14 

203 

1 

17 

58 

1 

17 

125 

1 

11 

Azavea 


35 


rev  1:  25  March  2013 


The  World  Bank:  OSTTP  - Indicator  Calculation  Methodology 


•lazavea 


80 

1 

17 

B20 

1 

17 

206 

1 

16 

628 

1 

15 

18 

1 

15 

By  mode: 

description  | duration 

+ 

Bus  | 24 


Number  of  stops  per  kilometer 
route_id  | coverage 


route_id 

coverage 

50 

1 

1.88251401189407 

903 

1 

1.75573782228987 

273 

1 

3.15135786449161 

621 

1 

1.30832969908417 

112 

1 

1.60732380586324 

111 

1 

1.8308808571235 

12 

1 

1.17311226684394 

B1A 

1 

1.26292000398817 

272 

1 

1.94274612879265 

203 

1 

2.40711262957642 

58 

1 

1.99264255058246 

125 

1 

0.860030101053538 

80 

1 

1.99474716579672 

B20 

1 

1.70614869742109 

206 

1 

2.19178082191781 

628 

1 

1.65584002037957 

18 

1 

1.49142431021625 

Number  of 

stops  per  kilometer  (b 

description 

| coverage 

- + 

Bus  | 1.82267408401811 

Average  distance  between  stops  (by  route) 


route_id  | routeavgstopdist 
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+ 

50 

1 

0.531204545454545 

903 

1 

0.569561119721041 

273 

1 

0.317323529411765 

621 

1 

0.76433333333333 

112 

1 

0.622152173913043 

111 

1 

0.546185185185181 

12 

1 

0.85243333333333 

B1A 

1 

0.791815789473683 

272 

1 

0.514735294117645 

203 

1 

0.415435483870969 

58 

1 

0.501846153846156 

125 

1 

1.16275 

80 

1 

0.50131666666667 

B20 

1 

0.586115384615385 

206 

1 

0.456249999999999 

Average  distance  between  stops  (by  mode) 


description  | coverage 

- +- 

Bus  | 0.548644438832139 


Average  time  between  consecutive  stops  on  a trip  (in  minutes) 


By  route: 


route_id  | avg_min_between_stops 

289 

| 4.56439393939393939394 

48 

| 4.80000000000000000000 

113 

| 4.79986962190352020861 

962 

| 4.86111111111111111111 

Bll 

| 4.82142857142857142857 

2 

| 4.86842105263157894737 

B16 

| 4.82758620689655172414 

131 

| 4.72953216374269005848 

254 

| 4.75000000000000000000 

135 

| 4.67436974789915966387 

275 

| 4.68750000000000000000 

B10 

| 4.67708333333333333333 

79 

| 4.82450738916256157635 

268 

| 4.70588235294117647059 
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4 | 4.71405228758169934641 

By  mode: 

description  [ avg_min_between_stops 

--+ 

Bus  I 4.74547519609168491380 


Average  weekday  frequency 

(stops  per  hour) 

buses_per_hour  | 

route_id 

+ 


9.82578397212543554007 

1 

12.29234199468313180688 

10 

12.06783493499152063313 

100 

12.05555555555555555556 

101 

12.42156108909524654482 

102 

12.35992372477355792150 

103 

12.06451612903225806452 

104 

12.06666666666666666667 

105 

12.06666666666666666667 

106 

12.33932391138273491215 

107 

12.05555555555555555556 

108 

12.51366828131329712378 

109 

12.06896551724137931034 

11 

12 . 07407407407407407407 

111 

12.28481240981240981241 

112 

12 . 49397877984084880637 

113 

12 . 07407407407407407407 

By  mode: 

114 

buses_per_hour 

description 

+ 


12.18434079140008432690  | 

Bus 

Additional  indicators  not 

calculated  due  to  lack  of  road  network  and  population  data 
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